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In analogy with its classical counterpart, a noisy quantum channel is characterized by a loss, a 
quantity that depends on the channel input and the quantum operation performed by the channel. 
The loss reflects the transmission quality: if the loss is zero, quantum information can be perfectly 
transmitted at a rate measured by the quantum source entropy. By using block coding based on 
sequences of n entangled symbols, the average loss (defined as the overall loss of the joint n-symbol 
channel divided by n, when n — > oo) can be made lower than the loss for a single use of the channel. 
In this context, we examine several upper bounds on the rate at which quantum information can 
be transmitted reliably via a noisy channel, that is, with an asymptotically vanishing average loss 
while the one-symbol loss of the channel is non-zero. These bounds on the channel capacity rely on 
the entropic Singleton bound on quantum error-correcting codes [Phys. Rev. A 56, 1721 (1997)]. 
Finally, we analyze the Singleton bounds when the noisy quantum channel is supplemented with a 
classical auxiliary channel. 



PACS numbers: 03.65.Bz, 03.67.Hk, 89.70.+C 



I. INTRODUCTION 

Within recent years, the quantum theory of informa- 
tion and communication has undergone a dramatic evolu- 
tion (see, e.g., Q]). Major progress has been made toward 
the extension to the quantum regime of the classical the- 
ory of information pioneered by Shannon Q . In particu- 
lar, the use of quantum communication channels in order 
to transmit not only classical information but also intact 
quantum states (or quantum information) has received a 
considerable amount of attention, following the proof of 
the quantum analog of Shannon's fundamental theorem 
for noiseless coding by Schumacher S . It has been shown 
that the von Neumann entropy plays the role of a quan- 
tum information-theoretic entropy in the sense that it 
characterizes the minimum amount of quantum resources 
(e.g., number of quantum bits) that is necessary to code 
an ensemble of quantum states with an asymptotically 
vanishing distortion in the absence of noise. This result 
suggests that a general quantum theory of information, 
paralleling Shannon theory, can be developed based on 
this concept. While such a full theory does not exist as of 
yet, a great deal of effort has been devoted to this issue 
over the last few years, and several fundamental results 
have been obtained, ranging from entanglement-based 
communication schemes m to quantum error-correcting 
codes |^| . In particular, a substantial amount of work has 
been devoted recently to the transmission of arbitrary 
states (or quantum information) through noisy quantum 
channels (see, e.g., [^|-^)). A quantum state processed by 
such a channel undergoes decoherence by interacting with 
an external system or environment, which effects an alter- 
ation of quantum information. A natural question that 
arises in this context concerns the possibility of transmit- 



KRL preprint MAP-215 

ting quantum information reliably, in spite of quantum 
noise, if it is suitably encoded as sequences of quantum 
bits in analogy with the standard construction used for 
classical channels. More specifically, a fundamental is- 
sue is to understand the quantum analog of Shannon's 
noisy channel coding theorem and to define the capacity 
of a noisy quantum channel, i.e., an upper limit to the 
amount of quantum information that can be processed 
with an arbitrarily high fidelity. While several attempts 
have been made to define a quantum analog of Shannon 
mutual information that would be a natural candidate 
for such a quantum measure of capacity (see the concepts 
of coherent information |?],|8| or von Neumann mutual en- 
tropy [PUlCjl ) , the problem of characterizing in general the 
capacity of a noisy quantum channel is still unsolved. 

The purpose of this paper is to further clarify the de- 
scription of noisy quantum channels centered on the von 
Neumann mutual entropy (see ||). It has been shown re- 
cently that a consistent information-theoretic framework 
that closely parallels Shannon's construction can be de- 
veloped, based on von Neumann conditional and mutual 
entropies [p^4j3[. The central peculiarity of this frame- 
work is that it involves negative conditional entropies 
in order to account for quantum non-local correlations 
between entangled variables. This is in contrast with 
Shannon information theory in which marginal and con- 
ditional entropies are all non-negative quantities. Neg- 
ative quantum conditional entropies simply reflect the 
non-monotonicity of the von Neumann entropy p"4| ] (the 
entropy of a composite system can be lower than that of 
its components if the latter are entangled). The resulting 
information-theoretic formalism provides grounds for the 
quantum extension of the usual algebraic relations be- 
tween Shannon entropies in multipartite systems [pT[-fi3[ . 
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Surprisingly, many concepts of Shannon theory can be 
straightforwardly translated to the quantum regime by 
extending the range for quantum (conditional and mu- 
tual) entropies with respect to the classical one in or- 
der to encompass entanglement pJ. This is very help- 
ful in analyzing quantum information processes in a uni- 
fied framework, paralleling Shannon theory. For exam- 
ple, entanglement-based quantum communication pro- 
cesses [ [To] , quantum channels ||, and quantum error- 
correcting codes |D| can be described along these lines. 

In this paper, we focus on the application of this 
information-theoretic framework to the issue of finding 
upper bounds on the capacity of quantum codes and 
quantum channels. In Section II, we outline the general 
treatment of noisy quantum channels based on quantum 
entropies j^] , and extend it to the characterization of con- 
secutive uses of a quantum memoryless channel (cf. the 
notions of one-symbol and average loss explained in Sec- 
tion IID). This provides a simple framework to consider 
block coding with quantum channels. Note that, just as 
in Shannon information theory, quantum entropic consid- 
erations alone do not result in constructive methods for 
building codes. Rather, they are useful to derive bounds 
on what can possibly be achieved or not, from basic prin- 
ciples. Accordingly, we analyze in Section III several 
upper bounds (based on the Singleton bound on quan- 
tum codes [^5|) for standard quantum channels such as 
the quantum erasure or depolarizing channel. This con- 
firms bounds on the quantum capacity that were derived 
otherwise, but places this problem in a unified context. 
Finally, we examine in Section IV the extension of this 
quantum entropic treatment of noisy quantum channels 
to the case where an auxiliary classical channel is avail- 
able. Quantum teleportation appears then as a special 
case of this construction when no block coding is applied. 

II. ENTROPIC CHARACTERIZATION OF 
NOISY QUANTUM CHANNELS 

A. Notations 

Let us start by summarizing the basic definitions that 
will be useful in the rest of this paper when consider- 
ing noisy quantum channels. The entropy of a quantum 
system X (of arbitrary dimension) is defined as the von 
Neumann entropy of the density operator px that char- 
acterizes the state of X, i.e., 

S(X) = S[ PX ] = -Tt( Px log 2 p x ) (2.1) 

It can be viewed as the uncertainty about X in the sense 
that it measures (asymptotically) the minimum number 
of quantum bits (qubits) necessary to specify X || . This 
definition can be extended to the notions of conditional 
and mutual von Neumann entropies, based on a simple 



parallel with their classical counterparts which is moti- 
vated in |l(]-[l^]. For a bipartite system XY character- 
ized by pxy, the conditional von Neumann entropy is 

S(X\Y) = S(XY) - S(Y) (2.2) 

while the mutual von Neumann entropy is 

S(X:Y) = S{X)-S(X\Y) 
= S(Y) - S(Y\X) 

= S(X) + S(Y)-S(XY) (2.3) 

where S(XY) is calculated from pxy, while S(X) and 
S(Y) are obtained from the reduced density operators 
px = T^y(pxy) and py = Ttx{pxy)- Subadditiv- 
ity of quantum entropies implies S(X:Y) > 0, where 
the equality holds if X and Y are independent (i.e., 
Pxy — Px &> Py)- Note that, when S(XY) = (i.e., the 
joint system XY is in a pure state), we have S(X:Y) = 
2S(X) = 2S(Y) as a consequence of the Schmidt decom- 
position. This property will be useful in the following. 
Several quantum entropies can also be defined for char- 
acterizing multipartite quantum systems. Consider, for 
instance, a tripartite system XYZ. The von Neumann 
conditional mutual entropy (of X and Y , conditionally 
on Z) can be defined as 

S(X:Y\Z) = S{X\Z) - S(X\YZ) 

= S(X\Z) + S(Y\Z) - S{XY\Z) 

= S(XZ) + S(YZ) - S(Z) - S{XYZ) (2.4) 

in perfect analogy with the classical expressions. Note 
that the strong subadditivity of quantum entropies im- 
plies S(X:Y\Z) > 0. We can also define the von 
Neumann ternary mutual entropy as 

S{X:Y:Z) = S(X:Y) - S(X:Y\Z) (2.5) 

Note that, if S(XYZ) = (i.e., the ternary system is in 
a pure state), then S(X:Y:Z) = [Q, or, equivalently, 
S(X:Y) = S(X:Y\Z), a property which is very useful in 
the analysis of quantum channels. Also, chain rules for 
quantum entropies can be written, such as 

S{X:YZ) = S(X:Y) + S(X:Z\Y) (2.6) 

which parallel the classical relations [Q. The motiva- 
tion for building such a quantum entropic framework is 
that it provides an information-theoretic formulation of 
quantum entanglement in multipartite systems, unified 
with Shannon's description of classical correlation. It is 
an extension of Shannon's formalism beyond its original 
range, as reflected for example by the fact that the quan- 
tum mutual entropy can reach twice the maximum value 
allowed for classical entropies ]1Q] , that is, 

< S(X:Y) < 2 mm[S(X),S(Y)} (2.7) 

This factor 2 appears in many quantum information- 
theoretic relations (see below), and originates from the 
Araki-Lieb inequality for quantum entropies [|L0|— pT2[| . 
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B. Quantum mutual entropy, loss, and noise 

Let us now outline the entropic treatment of a noisy 
quantum channel (see also Ref. 0). Such a treatment 
explicitly displays the correspondence with the standard 
description of noisy classical channels (see Appendix A), 
thereby unifying classical and quantum channels. Our 
description involves three quantum systems of arbitrary 
dimensions: Q (the quantum system whose processing 
by the channel is concerned), R (a "reference" system 
which Q is initially entangled with), and E (an exter- 
nal system or environment which Q is interacting with 
in the noisy channel). More specifically, we assume that 
Q is initially entangled with R. so that the joint state 
of Q and R is the pure state \^rq). We may as well 
regard Q as a quantum source, being initially in a mixed 
state pq (realized by a given ensemble of quantum states 
associated with some probability distribution). The "pu- 
rification" of pq into |$ rq) can always achieved by ex- 
tending the Hilbert space Hq to Hrq, so that we have 
Pq = T^r(\^rq)(^rq\)- The corresponding reduced von 
Neumann entropies are 



S(R) = S{Q) = S 



(2.8) 



where S is called the source entropy. In the dual picture 
where an arbitrary pure state of Q (rather than entan- 
glement) is sent through the channel, S then measures 
the "arbitrariness" of Q (it can be viewed as the aver- 
age number of quantum bits that are to be processed 
by the channel in order to transmit the state of Q). In 
what follows, we prefer to consider a quantum input Q 
that is entangled with R, so that the preservation of 
entanglement — rather than of arbitrary states — will be 
the central feature of a quantum transmission channel. 
The initial mutual entropy to be transmitted is thus 



S{R:Q) = 2S 



(2.9) 



that is, twice 1 the source entropy. 

When it is processed by the channel, Q interacts with 
E (assumed to be initially in a pure state |0)) according to 
the unitary transformation Uqe, inducing decoherence. 
This describes the most general (trace-preserving) oper- 
ation of a quantum channel that is allowed by quantum 
mechanics. Roughly speaking, the resulting noisy quan- 
tum channel is such that, typically, only a fraction of the 
initial entanglement with R can be recovered after having 
been processed by the channel (the rest of the entangle- 
ment with R is lost, in the sense that it is transferred to 



the environment). More specifically, the decohered quan- 
tum system after interaction with E, denoted as Q' , is in 
the state 



p' Q = Tr E (Uqe(pq®\0)(0\PI 



QE 



(2.10) 



where pq is the initial state of Q (with source entropy 
S). The completely positive linear map pq — > p'q cor- 
responds to the "quantum operation" performed by the 
noisy channel 0. After such an environment-induced 
decoherence, the joint system R'Q'E' is in the state 
\® R'Q'E') = (1r <8> Uqe)\Vrq)\0e) whose entropy Venn 
diagram is represented in Fig. |l| (the primes refer to the 
systems after decoherence). Note that, as the reference 
is not involved in decoherence, we have R 1 = R. 

FIG. 1. Schematic representation of the quantum opera- 
tion effected by a noisy quantum channel. The quantum sys- 
tem Q is initially entangled with the reference R, with a mu- 
tual entropy of twice the source entropy S (this is indicated 
by a dashed line). Then Q decoheres by interacting with an 
environment E (initially in a pure state |0)). The entropy 
Venn diagram summarizes the entropic relations between Q' 
(output of the quantum channel), R' (reference), and E' (en- 
vironment) after decoherence. The three parameters, L, 
and N, denote the von Neumann mutual entropy (quantum 
information), the loss, and the noise, respectively. 



2S 



Q' 




The entropy diagram of R'Q'E' depends on three pa- 
rameters, the von Neumann mutual entropy (or the quan- 
tum information) /, the loss L, and the noise TV, these 
quantities being defined in analogy with their classical 
counterparts: 



/ = S{R:Q') 
L = S(R:E'\Q') 
N = S(Q':E'\R) 



S(R:E') 
S(Q':E') 



(2.11) 
(2.12) 
(2.13) 



The classical correspondence can be made fully explicit 
by including an environment in the description of a clas- 
sical channel, as shown in 0. The second equality in 



1 Note that this factor two reflects a fundamental difference 
between classical and quantum channels (see Appendix A for 
comparison). Such a factor is omnipresent in the quantum 
information-theoretic relations between entropies |12]. 
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Eqs. ( 2.12Q and ( 2.13| ) has no classical analog, and re- 
sults from the vanishing of the ternary mutual entropy 
S(R:Q':E') (see Physically, the quantum infor- 

mation / corresponds to the residual mutual entropy be- 
tween the decohered quantum output Q' and the refer- 
ence system R that purifies the quantum input Q. The 
loss L is the mutual entropy that has arisen between the 
environment after decoherence E' and the reference sys- 
tem R, while the noise N is the mutual entropy between 
the decohered quantum output Q' and the environment 
E' , Note that /, L, and N can be written as a function of 
reduced entropies only, without explicitly involving the 
environment E in the discussion, by making use of the 
Schmidt decomposition of the state of R'Q'E\ namely 
S(E') = S(RQ'): 



I = S(Q) + S(Q') - S(RQ') 
L = S(Q) + S{RQ')-S(Q') 
N = S(Q') + S(RQ') - S(Q) 



(2.14) 
(2.15) 
(2.16) 



It can also be shown that these three quantities are in 
fact independent of the choice of the reference system R 
whenever the latter purifies the quantum input Q, so that 
they provide a most concise entropic characterization of 
information flow in the channel. They depend in gen- 
eral on the channel input (i.e., pq) and on the quantum 
operation performed by the channel (i.e., the completely 
positive trace-preserving map on Q that is specified by 
Uqe in the joint space of Q and E). This exactly paral- 
lels the situation for the analog classical quantities. The 
information /, loss L, and noise N of a classical channel 
of input X and output Y (see Appendix A) indeed de- 
pend on the input distribution p(x) and on the channel 
"operation" characterized by p{y\x). 

Among these three quantities, only / and L are rele- 
vant as far as (forward) information transmission through 
the channel is concerned (the noise N plays a role in the 
description of the "reverse" channel, just as for classical 
channels). Indeed, information processing is character- 
ized by the balance between the von Neumann mutual 
entropy and the loss, these two quantities always sum- 
ming to twice the source entropy: 



I + L = 2S(Q) = 2S 



(2.17) 



The mutual entropy / = S(R:Q') represents the amount 
of the initial mutual entropy with respect to R (i.e., 2S) 
that has been processed by the channel, while the loss 
L = S(R:E') corresponds to the fraction of it that is 
unavoidably lost in the environment. If the channel is 
lossless (L — 0), then / = 2S, so that the interaction 
with the environment can be perfectly "undone" , and 
the initial entanglement of Q can be fully recovered by 
an appropriate decoding |7||1. (Equivalently, this means 
that an arbitrary initial state of Q can be recovered with- 
out error.) This can be understood by noting that R does 



not become entangled directly with the environment in a 
lossless channel, but only via the output Q' (see Fig. |l| 
when L = 0). An operation on Q' only (namely, the de- 
coding operation) is enough to transfer the entanglement 
with E' (measured by the noise N) to an ancilla, while 
preserving the entanglement 2S with R. 

Thus, if L = 0, a perfect transmission of informa- 
tion (including quantum information) can be achieved 
through the channel by applying an appropriate decod- 
ing. When 1 = 0, on the other hand, no information 
at all (classical or quantum) can be processed by the 
channel. This is the case, for example, of the quantum 
depolarizing channel with p = 3/4 (see Section HID). In 
between these limiting cases, classical information (and, 
up to some restricted extent, quantum information) can 
be reliably transmitted at the expense of a decrease in 
the rate by making use of block coding. The analysis of 
such a transmission of quantum information immune to 
noise is the main focus of this paper. 

For completeness, let us mention that a channel with 
N = is the quantum analog of a deterministic chan- 
nel [ fus] , that is, a channel where the input fully deter- 
mines the output (see Appendix A) . The quantum output 
Q' is indeed not directly entangled with E' but only via 
-R, which implies that its entanglement with R remains 
intact (see Fig. [j] when N = 0). This does not mean, 
however, that perfect error correction is achievable, as 
an operation on the reference R is needed to recover the 
initial entanglement 2S between Q and R. A channel 
which is both lossless (L = 0) and deterministic (N = 0) 
is called noiseless; its action on Q is the identity operator 
(or any fixed unitary operator). For example, the overall 
channel including a noisy quantum channel along with 
the encoder and decoder is obviously noiseless if perfect 
error correction is achieved. (In other words, the decoder 
is used to eliminate the quantum noise N ^ by trans- 
ferring the entanglement with E to an ancilla, which then 
makes the overall channel noiseless provided that L = 0.) 
It is worth noting here that the noise N and the loss 
L play symmetric roles when considering the "reverse" 
channel obtained by interchanging the input and output. 
(This is true for classical channels as well.) More specif- 
ically, N and / always sum to twice the output entropy, 



N = 2S{Q') 



(2.18) 



in analogy with Eq. ( 2.17 ). Roughly speaking, N plays 
the role of the loss of the reverse channel, as shown in 
Sec. IIC. 



C. Properties of quantum /, L, and N 

The above entropies for a noisy quantum channel can 
be shown to fulfill several properties, akin to classical 
ones, which make them reasonable quantum measures of 
information, loss, or noise (see also Ref. M). First, the 
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quantum mutual entropy I can be shown to be concave 
in the input pq for a fixed channel, i.e., a fixed quantum 
operation pq — > pq> or a fixed Uqe- Therefore, any lo- 
cal maximum of I is the absolute maximum, that is, the 
von Neumann capacity of the channel. This parallels the 
concavity of the Shannon mutual entropy H(X:Y) in the 
input probability distribution p(x) for a fixed channel, 
i.e., fixed p{y\x) [ p~7[ . Second, I is convex in the output 
Pqi for a fixed input pq . This property will be used in the 
next Section when considering a "probabilistic" channel 
(the effective channel resulting from the probabilistic use 
of a family of channels) . It is the quantum analog of the 
property that the information H(X:Y) processed by a 
classical channel is a convex function oip(y\x) for a fixed 
p(x) Jl?]]. These two properties are simple to prove by 
reexpressing the von Neumann mutual entropy / as 

S(R:Q') = S{Q'E') + S{Q') - S(E') 

= S(Q') + S(Q'\E') (2.19) 



or as 



S(R:Q') = S(R) + S(Q') - S{RQ') 
= S(R)-S(R\Q') 



(2.20) 



If the input pq is a convex combination of density opera- 
tors while the channel is fixed, it is easy to see that pqe 
and therefore pq> e' are also convex combinations (as the 
channel operation is linear). Since the conditional en- 
tropy S(Q'\E') is concave in a convex combination of 
Pq'E' while S(Q') is concave in pq> Q, Eq. ( 2. IE ) im- 
plies the concavity of the quantum mutual entropy I in 
the input for a fixed channel. The second property can 
be proven the same way by noting that, if we have a 
"probabilistic" channel — a convex combination of quan- 
tum channels — acting on a fixed input, then prq> is a 
convex combination of density operators while pn is con- 
stant. Thus, Eq. (2.20) together with the concavity of 
the conditional entropy S(R\Q') in a convex combina- 
tion of prq' implies that the quantum mutual entropy / 
is convex in the output for a fixed input. 

A third important property is that the mutual entropy 
I and the quantum loss L are subadditive when consid- 
ering a channel made of several independent quantum 
channels used in parallel. This will be shown when ana- 
lyzing quantum block coding (cf. Sect. IID). Finally, it 
can be proved that I obeys (forward and reverse) data- 
processing inequalities when considering chained quan- 
tum channels. If we chain two channels by using the 
output of the first as an input for the second (see Fig. ||), 
the total (1+2) channel pq — + pq> — > pqh is characterized 
by 



7 12 = S(R:Q") 
L 12 = S(R:E'E") 
N u = S{Q":E'E") 



(2.21) 
(2.22) 
(2.23) 



since we can regard the two environments E' and E" as 
a global environment for this total channel . 

FIG. 2. Schematic view of the chaining of two noisy quan- 
tum channels. In each of them, the input state decoheres by 
interacting with a (separate) environment. The input of the 
first channel is initially entangled with R, with a source en- 
tropy of S (see the dashed line). The output of this channel 
Q' is then used as an input for the second channel. Since Q' 
is purified by RE' (not by R alone), the "reference" system 
that must be considered in the entropic characterization of 
the second channel is RE' . 
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Using the chain rule for quantum mutual entropies 
S(R:E'E") = S(R:E') + S(R:E"\E'), and remembering 
that S(R:E"\E') > as a result of strong subadditivity, 
we obtain 



< L x < L u 



(2.24) 



where L\ = S(R:E') is the loss of the first channel while 
L12 is the loss of the total channel. Thus, the loss can 
only increase by further processing of quantum informa- 
tion in the second channel. Since I\ + Lx = 2S(Q) and 
/12+L12 = 2S(Q), we obtain the forward data-processing 
inequality 



1x2 <h< 25(g) 



(2.25) 



implying that the mutual entropy of the total channel 
cannot exceed the one of the first channel. This is the 
quantum analog of H(X:Z) < H{X:Y) < H(X) for 
chained classical channels X — > Y — > Z jlTj . 

Now, if we use the chain rule S{Q":E'E") = 
S{Q":E") + S{Q":E'\E") together with strong subad- 
ditivity, we obtain 



< N 2 < Nx2 



(2.26) 



where N 2 — S(Q":E") is the noise of the second chan- 
nel while N12 is the noise of the total channel. As 
h + N 2 = 2S(Q") and J 12 + N 12 = 2S(Q"), we obtain 
the reverse data-processing inequality 



Ii2 <h< 2S(Q") 



(2.27) 



where I 2 = S(RE':Q") is the mutual entropy processed 
by the second channel. (Note that the "reference" sys- 
tem that purifies the input Q' of the second channel is 
RE'.) This parallels the classical inequality H(X:Z) < 
H(Y:Z) < H{Z) for chained channels 0. Eqs. ( jgjzg ) 
and (2.26) emphasize that the loss L and the noise N 
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play a symmetric role in this entropic description if one 
interchanges the input and the output of the quantum 
channel ("time-reversal"), just as for classical channels. 
This is reflected by the symmetry between the forward 
and the reverse data-processing inequalities. 



D. One-symbol loss and average loss 

The central idea of classical error correction by block 
coding is to introduce correlations between the bits that 
make a block, in order to have redundancy in the trans- 
mitted flow of data. This can make the transmission 
asymptotically immune to errors, up to some level of 
noise. In quantum error-correcting codes, the qubits that 
form a block are entangled in a specific way, so that a par- 
tial alteration due to decoherence can be recovered 0. 
Even though entanglement gives rise to some qualita- 
tively new features (see |l5| for a detailed analysis), the 
objective is similar. Namely, when block coding is used, 
i.e., when say k "logical" qubits are encoded into blocks 
of n "physical" qubits, it is possible to achieve a situation 
where the overall loss of the joint (n-bit) channel is arbi- 
trarily small, while the loss for individual qubits (for each 
use of the channel) is finite. In analogy with the classi- 
cal construction, if blocks of n qubits that are initially 
entangled with respect to R (with a mutual entropy 2k) 
can be transmitted through the channel with an asymp- 
totically vanishing overall loss, we say that the channel 
processes 2k /n bits of entanglement per qubit. Equiva- 
lently, the channel is transmitting at a rate R — k/n (on 
average, k arbitrary binary quantum states can be trans- 
mitted for n transmitted qubits). The maximum rate at 
which quantum information can be reliably sent through 
the noisy channel is defined as the quantum channel ca- 
pacity. (This maximum has to be taken over all possible 
coding schemes, and for n — > oo.) Whether a good (and 
operational) definition of such a "purely quantum" chan- 
nel capacity exists is currently an open question. In the 
following, we restrict ourselves to the issue of finding up- 
per bounds on the rate of perfect quantum information 
transmission (and therefore on such a "purely quantum" 
capacity) . 

Let us consider the asymptotic use of a quantum dis- 
crete memoryless channel, where n (tending to infin- 
ity) qubits are transmitted sequentially. 2 Each qubit 
may decohere due to an environment (quantum noise), 
the exact interaction depending on the considered noise 



model. The important point is that the environment for 
each qubit is initially independent of the one interact- 
ing with every other qubit. Thus the information pro- 
cess can be viewed as n sequential uses of a quantum 
memoryless channel (the environment being "reset" af- 
ter each use) or, equivalently, as n parallel independent 
channels processing one qubit each (see Fig. ||). We as- 
sume that the set of n input symbols (Qi, ■ ■ ■ Q n ) are 
initially entangled with R, so that S(R:Qi ■ ■ ■ Q n ) = 2S 
and S(R) = S(Qi ■ ■ ■ Q n ) = S. If we consider these 
n symbols as the single input of a joint n-bit channel 
Qi ■ ■ ■ Qn — * Qi ■ ■ ■ Q'n, information transmission is de- 
scribed by the mutual entropy 

/ = S(R:Q[ ■ ■ ■ Q' n ) 
= S{Q l ■ ■ ■ Q n ) + S(Q[ ■ ■ ■ Q' n ) - S(E[ ■ ■ ■ E' n ) 
= S(Q[ ■ ■ ■ Q' n \E[ ■ ■ ■ E'J + S(Q[ ■ ■ ■ Q'J (2.28) 



and the loss 

L = S(R:E[...E' n ) 
= S(Q 1 ■■■Q n ) + S(E[ 
= S(E' 1 ...E' n \Q' 1 ...Q' v 



E' n )-S(Q' 1 ---Q' n ) 
±S(E[...E' n ) (2.29) 



where we have made use of the conservation of entropy 
imposed by the unitarity of the global interaction with 
the n environments E\, ...E n . Obviously, we have 
I+L = 2S(Qi ■ ■ ■ Q n ) = 2S(R), which is twice the source 
entropy S of the joint channel. 

FIG. 3. Schematic view of a memoryless quantum channel. 
This channel is used n times, but the environment is "reset" 
after each use. This can be viewed as n parallel (independent) 
channels, each one being used for one of the input symbols. 
The n input symbols (Qi, Q2, ■ ■ ■ Qn) are initially entangled 
with R (as indicated by a dashed line), with a joint source 
entropy of S. 



Q,- 



— 



2 Throughout this paper, we use indistinctly the terms qubit 
or symbol to denote the quantum state that is sent in a sin- 
gle use of the channel. As a matter of fact, the reasoning is 
totally general, and applies to quantum states (or symbols) 
in a Hilbert space of arbitrary dimension > 2. 
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Each individual channel Qi — > Q\ can be described in 
the same way, noting that each Qi interacts with an envi- 
ronment Ei (initially in a pure state |0)) which results in 
Qi and E[. The only difference here is that R has to be 
supplemented with all the input Q's except Qi in order to 
purify Qi. Thus, the mutual entropy characterizing the 
i-th channel is 

h = S{RQ 1 ■ ■ ■ Qi-xQi+i ■ ■ ■ Q n :Q'i) (2.30) 

while the corresponding loss is 

U = S{RQ X ■ ■ ■ Qi-iQi+i ■ ■ ■ Qn.E'i) (2.31) 

These quantities can be re-expressed by using the fact 
that the environments Ex, ... E n are initially in a prod- 
uct state and that the Qi and Ei interact pairwise. We 
have 



and 



h = S(Qi) + S(Q<) - S{E'i) 
= S(Q' l \E' l ) + S(Q , l ) 



U = S(Qi) + S(El) - S(Q'i) 
= S(E>\Qi i ) + S(E>) 



(2.32) 



(2.33) 



For each channel, the loss and the mutual entropy sum to 
twice the source entropy of the channel (tracing over all 
the other channels): Ii + Li — 2S(Qi). The subadditivity 
of von Neumann entropies, 



S(E[---E' n ) < S(E[) 



(2.34) 



and the subadditivity of von Neumann conditional en- 
tropies 



S(E[---E' n \Q' 1 ---Q' n )<S(E' 1 \Q' 1 ) 



S{E' n \Q' n ) 
(2.35) 



combined with Eqs. (2.29) and (2.33) imply that the loss 
is subadditive: 



L<Lx 



(2.36) 



The same reasoning can be made using Eqs. ( gjgh and 
( |2.32|) and interchanging the Q'^s and £^"s in Eqs. ( 2.34j ) 
and (2.35), which results in the equivalent expression for 
mutual entropies: 



In 



(2.37) 



The latter inequality corresponds to the subadditivity of 
the von Neumann mutual entropy for parallel channels. 
Finally, using the relation between the loss and the mu- 
tual entropy for individual channels and for the joint 
channel, we obtain 



Li 



L n -2M <L< Li 



Lr. 



(2.38) 



with M = S{Q X )+- ■ ■+S(Q n )-S(Q 1 ■ ■ ■ Q n ) > 0. Equiv- 
alently, if we define the average loss 3 of the joint n-bit 
channel as I = L/n for n — * oo, we see that 



W - 2m < I < h 



(2.39) 



where m = M/n and l\ = {L\ + ■ • • + L n )/n is the 
one-symbol loss, i.e., the loss for a single use of an in- 
divid ual ch ann el ave raged over all 1-bit channels. Thus, 
Eqs. ( 2.38 ) or ( 2.39 ) imply that the loss cannot increase 
by using block coding (using parallel channels). It typi- 
cally decreases by an amount which is bounded by 2M (or 
2m), a quantity related to the entanglement between the 
input symbols. (Note that M = if the input symbols 
are independent.) The analog construction for a classi- 
cal channel is presented in Appendix A in order to clarify 
the straightforward classical to quantum correspondence. 
As an example, let us consider the use of blocks of two 
qubits. Assume also that the two 1-bit channels are iden- 
tical, i.e., L\ = L-i = l\. The average loss of the joint 
2-bit channel, I — L/2, can be bounded by 



h -5(Qi:Q 2 ) < I < h 



(2.40) 



This explicitly shows that block coding can decrease the 
average loss only when the symbols are entangled [i.e., 
S(Qv.Q 2 ) > 0]. 



Equations (2.38) or (2.39) allow us to derive a simple 



upper bound on the maximum achievable rate by block 
coding, as a function of the one-symbol loss (or mutual 
entropy) for a single use of the channel. Indeed, only if 
the lower bound on L (or I) extends to zero (that is, if 

2M > Li-\ \-L n ) is it possible that block coding makes 

the joint channel perfectly immune to noise while each 1- 
bit channel has a non-vanishing loss. Thus, we have the 
necessary condition for having a vanishing average loss 
(1 = 0): 



25(Qi • • • Q n ) < 2S{Qx) + ■■■ + 2S(Q n ) 
—L\ — ■ ■ ■ — L n 



(2.41) 



3 The average loss I reflects the effective loss per qubit pro- 
cessed in the noisy channel, that is, the loss affecting the 
overall process (encoding + joint channel viewed as n paral- 
lel one-bit noisy channels) divided by the number of physical 
qubits n when n — > oo. 
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As a consequence, the rate of quantum information trans- 
mission through the joint channel, R = S(Qi ■ ■ ■ Q n )/n>, 
is bounded from above by half the averaged one-symbol 
mutual entropy for individual channels: 



R < 



h 



2n 



(2.42) 



Thus, the (averaged) mutual von Neumann entropy char- 
acterizing each use of the channel provides an upper 
bound on the achievable rate of transmission by block 
coding through the noisy channel. Except for the fac- 
tor 1/2, this inequality parallels the one for a classical 
channel (see Appendix A). Remember that the quantum 
capacity of a channel is defined as the maximum rate 
that can be achieved through the channel (over all possi- 
ble input and coding schemes) with a fidelity arbitrarily 
close to one. The classical analogy suggests then that the 
(maximum) one-symbol von Neumann mutual entropy 
yields the quantum capacity. However, this upper bound 
appears not to be attainable in general (see, e.g., M), in 
contrast with the equivalent classical bound. (The physi- 
cal meaning of the von Neumann mutual entropy is better 
understood in the context of noisy superdense coding, as 
shown in ||.) Therefore, it is necessary to derive more 
constraining entropic upper bounds on R, which is the 
main concern of the rest of this paper. 

In the next Section, we build on the entropic derivation 
of the Singleton bound for quantum codes presented in 
Ref. [[HI , and extend it to the treatment of noisy quantum 
channels in order to find better upper bounds on the rate 
of perfect quantum information transmission (and there- 
fore on the "purely quantum" capacity). The bounds 
that we derive can be attained in some cases (e.g., for 
the quantum erasure channel), or not in other cases (e.g., 
for the quantum depolarizing channel). It is unknown 
whether such a purely entropic approach unifying clas- 
sical and quantum channels can possibly yield the best 
(asymptotically attainable) upper bound, just as it is the 
case for classical channels, but this is not out of the ques- 
tion. This will be further investigated in future work. 



III. ENTROPIC BOUNDS ON CODES AND 
CHANNEL CAPACITIES 



In this Section, we derive several bounds, either on 
quantum codes or on quantum channel capacities, using 
an entropic approach based on the Singleton bound (see 
Ref. p|). 



A. Quantum channel subject to a p-bounded fraction 
of erasures 



We say that a quantum channel is subjected to a p- 
bounded fraction of erasures |Q8[ if, among n uses of the 
channel, a fraction of pn qubits^ (at most) are erased (or 
replaced by a distinguishable third state, e.g., |2)). When 
considering erasures (rather than errors), the important 
point is that it is possible to perform an incomplete mea- 
surement of each qubit at the output of the channel, to 
check whether it is in the |2) (erasure) state, or in the 
subspace spanned by |0) and |1), without destroying su- 
perpositions in the latter subspace [fl9f| . In this error 
model, transmission through the channel is considered 
successful if an arbitrary initial quantum state can be 
perfectly recovered (or the entanglement with R can be 
maintained) , which can obviously be achieved if one uses 
a quantum e-erasure correcting code with e = pn, that is, 
a code that allows any pattern of e qubits of each code- 
word to be erased. The rate (i.e., the average number of 
logical qubits transmitted with arbitrarily high fidelity 
per physical qubit) of a channel subjected to a p-bounded 
fraction of error is thus equivalent to the rate of a ((n, k)) 
quantum code correcting e = pn erasures. The rate of a 
((n, k)) code, i.e., a code mapping k logical qubits into 
codewords of n qubits, is defined as R = k/n. Conse- 
quently, an upper bound on the rate of quantum codes 
is simply equivalent to an upper bound on the rate of 
a channel with this particular error model (or an upper 
bound on the capacity, which is the highest achievable 
rate through the channel). When considering a channel, 
k is simply the source entropy S of the joint channel (i.e., 
the number of arbitrary qubits that are sent). 

It is known that an upper bound on the Hamming 
distance of nondegenerate quantum codes with fixed n 
and k can be derived from "sphere-packing" considera- 
tions |^(|. However, as a bound on the rate (or capac- 
ity) of a quantum channel involves a maximization over 
all coding schemes, including those based on degener- 
ate codes (which have been shown to exceed the Ham- 
ming bound plj|), only the bounds which are valid for 
all quantum codes are applicable to channels. As proven 



in a previous paper |15|, an upper bound on the rate of 
(nondegenerate and degenerate) quantum codes can be 
derived using entropic considerations only. This is the 
quantum Singleton bound: k < n — 2e (see also p2[). 
Translated in the channel language, this implies that an 
upper bound on the rate (and therefore the capacity) of 
a lossless (L — 0) channel subjected to a p-fraction of 
erasures is 



4 When n — > oo, the number of erasures pn can be considered 
as an integer without loss of generality. 
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R < 1 - 2p 



(3.1) 



S(R) < S(Q*) <n-2e 



(3.6) 



For completeness, we summarize the proof of the Sin- 
gleton bound given in jjq| . The basic idea of the proof 
will be useful in the following, when considering other 
channels. 

FIG. 4. Schematic representation of two possible partitions 
of Q into an erased piece Q e (or Q' e ) and unerased piece C_J it 
(or Q' u ). The "overlap" between the unerased pieces in both 
partitions is denoted by Q* . The entropic erasure-correction 
condition for the first partition is S(R:Q e ) = 0, while the 
condition for the second one is S(R:Q' e ) = 0. 
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As pictured in Fig. |J, for each pattern of erased qubits 
Q e , the entropic condition S(R:Q e ) — must be fulfilled, 
so that the unerased qubits Q u emerge from a lossless 
channel. This implies that the full entanglement of the 
codeword Q (with respect to R) must be "concentrated" 
in the unerased qubits Q u : 



S(R:Q U ) = S{R:Q) = 2S(R) 



(3.2) 



Using the fact that the joint state of RQ u Q e is pure, 
along with the subadditivity of quantum entropies, we 
have 



S(R:Q U ) = S(R) + S(Q U ) - S{RQ U ) 
= S(R) + S(Q' e Q*)-S(Q e ) 
< S{R) + S(Q' e )+S(Q*)-S(Q e ) 



(3.3) 



where we have divided the unerased qubits Q u into an- 
other pattern of e qubits, Q' e , and the remaining piece 
of n — 2e qubits, Q* . Eqs. (3.2) and (3.3) provide the 
inequality 



S(R) - S(Q*) < S(Q' e ) - S(Q e ) 



(3.4) 



Since this reasoning is symmetric in Q u vs. Q' u , the in- 
equality corresponding to the division of Q' u into Q e and 



S(R) - S(Q*) < S(Q e ) - S(Q' e ) 



(3.5) 



must also be satisfied. Combining these two inequalities 
yields the condition 



where the upper bound on S(Q*) simply results from the 
dimension of the Hilbert space of Q* . Since the encoding 
of a fc-qubit arbitrary state requires that S(R) = S(Q) = 
k, the above condition implies the quantum Singleton 
bound 



k < 



2e 



(3.7) 



The physical meaning of the Singleton bound is that, in 
order to have Q e independent of R and, at the same time, 
Q' u = Q e Q* fully entangled with R, a minimum Hilbert 
space for Q* (minimum number of qubits) is necessary 
in order to accommodate the source entropy k. 



B. Quantum channel subject to a p-bounded fraction 
of errors 

Another possible error model for a quantum channel is 
the case where a fraction of p qubits (at most) are altered 
by interacting with an environment. The difference with 
the previous error model is that the location of the er- 
rors is unknown (by contrast with erasures), i.e., there is 
no "flag" indicating which are the qubits that have been 
altered. 

Since a channel where the fraction of errors is bounded 
can be made lossless {L = 0) using an error-correcting 
code (just as for erasures in the previous Section), it 
is enough to use the correspondence between error- 
correcting codes and erasure-correcting codes to derive 
an upper bound on the rate of perfect transmission of 
quantum information in such a channel. In analogy with 
the classical situation, one can show that any code that 
corrects t errors is also able to correct up to e = 2t era- 
sures (23). This enables us to reuse the result of the 
previous Section simply by replacing p by 2p. Thus, we 
obtain the Singleton upper bound on the rate (or capac- 
ity) of a lossless (L = 0) channel with a fixed fraction p 
of errors: 



R < 1 - 4p 



(3.8) 



(This bound, or rather the fact that the rate of a code 
is vanishing at p = 1/4, is originally due to Knill and 
Laflamme 22 ). To our knowledge, the only stronger 
bounds on R for quantum codes that have been displayed 
for some special cases are: 



R < H (1/2 + y/2p(l - 2p)j for additive (or sta- 
bilizer) codes, where H stands for the dyadic Shan- 
non entropy Jl8[ | . This bound is based on an upper 
bound on classical linear codes. 5 



5 Some slightly stronger bounds have been recently obtained 
in Ref. Q. 
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• R = for p > 1/6 for general nondegenerate 
codes §§. 

It is worthwhile looking for improvement of Eq. ( [3.§| ) us- 
ing an entropic approach as presented above. 



C. Quantum erasure channel 

We now consider a quantum erasure channel with era- 
sure probability p (see, e.g., |l^]). In such a channel, each 
transmitted qubit has a probability p of being erased (and 
detectable at the output as an "erased" qubit). We are 
interested in the maximum rate of quantum information 
transmission that can be achieved by this channel. More 
precisely, our aim is to derive an upper bound on this rate 
using an entropic approach. It will appear that such an 
entropic bound exactly coincides with the capacity of a 
quantum erasure channel recently displayed in Ref. fl9[| . 

The central point of the reasoning is to describe the 
joint "probabilistic" channel (with n qubits at input and 
output) as a superposition of "binomial" n-bit channels 
(defined below). This allows us to make use of the con- 
vexity of the von Neumann mutual entropy in the output 
of a channel for a fixed input (see Sect. IIC) in order 
to derive an upper bound on R. We consider a fam- 
ily of "binomial" n-bit channels C (labeled by the index 
c), each characterized by a pattern of e erased qubits, 
Q e (c), and the complementary pattern of n — e unerased 
qubits, Q u (c). The probabilistic channel of interest here 
corresponds to a probabilistic use of these channels with 
a binomial distribution. More precisely, each channel c 
with e erased qubits is associated with a probability (or 
weight) w c — p e (l — p) n ~ e , and there are obviously (") 
distinct channels with e erased qubits. The superposition 
means, physically, that the resulting probabilistic joint 
channel consists in using one of these 2™ distinct "bi- 
nomial" n-bit channels with the appropriate probability. 
Thus, for a given n-bit input of the channel, say p, the 
output can thus be written as a convex combination 



E 



WcPc 



with 



(3.9) 



where w c is the weight of the c-th (binomial) channel in 
the superposition, and p' c is the output for that channel 
c. The convexity of the von Neumann mutual entropy in 
the output (for a given input) for the overall probabilistic 
channel implies that 



S(R:Q') <J2 w cS(R:Q' c ) 



(3.10) 



where / = S(R:Q') is the quantum mutual entropy be- 
tween the reference R and the output Q' of the joint 
(n-bit) quantum erasure channel, while I c = S(R:Q' C ) 



stands for the mutual entropy of the output of the c- 
th channel, Q' c , with respect to R. Note that, we have 
S(R:Q' C ) = S(R:Q u (c)) since only the unerased qubits 
of the c-th channel contribute to mutual entanglement 
with R (the erased qubits are independent of R). As ex- 
plained in Sect. IIC, the mutual von Neumann entropy 
S{R:Q') = S(R) - S(R\Q') is convex in the output (for 
a fixed input) because the conditional entropy S(R\Q') 
is concave in a convex combination of prq> [i.e., a con- 
vex combination of quantum channels acting on a fixed 
input, as shown in Eq. ( |3.9| )]. 

It is convenient to group the channels C into several 
classes, a ccord ing to the number of erased qubits, e. Us- 
ing Eq. ( 3.10| ), we can then write an upper bound on 
the processed quantum mutual information in the joint 
(n-bit) erasure channel of probability p as 



m < £(i - P ) n - e p e J2 s(r--Qu(c)) 



(3.11) 



where the sum over c spans the (™) channels where e 
qubits are erased. Before deriving a simpler expression 
of this upper bound using the Singleton bound, let us 
show that a simple relation between I(p) and 1(1 — p) 
can be obtained from Eq. ( |3.1l| ). First, note that 

S(R:Q' C ) = S(R:Q u (c)) 

= S(R) + S(Q u (c))-S(Q e (c)) 

= 2S(R) - S(R:Q e (c)) (3.12) 

where we have used the fa ct th at RQ u Q e is in a pure state 
for each channel C. Eq. ( 3.11 ) can then be rewritten as 



I(p) < 2S(R) - £(1 - p) n - e p e S(R:Q e (c)) (3.13) 

e=0 



The second term in the right-hand side of Eq. ( 3.13| ) can 
be interpreted as an upper bound on 1(1 — p), i.e., the 
information processed through a "dual" erasure channel 
of probability 1 — p where the erased qubits are replaced 
by unerased qubits and conversely. As a consequence, re- 
membering that S(R) = k if an arbitrary fc-bit quantum 
state is sent in the channel (i.e., the source entropy of the 
n-bit channel is k bits), we have 



I(p)+I(\-p) < 2k 



(3.14) 



for all n. Using I(p) + L(p) = 2k, the corresponding rela- 
tion for the quantum losses of the dual (with probability 
p and 1 — p) erasure channels is 



L(p) + L(l-p) > 2k 



(3.15) 



(Remember that < L(p) < 2k.) This implies that a 
necessary condition for having a perfect channel at prob- 
ability p, i.e., L(p) = 0, is that L(l — p) — 2k, i.e., that 
the dual channel at probability 1 — p is "fully erasing" 
(no information at all — either classical or quantum — is 
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processed through it). Another way of expressing this 
condition is by writing a lower bound on the loss of the 
erasing channel 



L(p)>I(l-p) 



(3.16) 



Only if no information is transmitted through the erasure 
channel of probability 1— p, or if 1(1— p) = 0, is it possible 
that the loss of the erasure channel of probability p van- 
ishes . Th is is obviously compatible with L(0) = 1(1) = 0. 
Eq. ( 3.15|) also implies that £(1/2) > k, so that the quan- 
tum erasure channel with probability 1/2 cannot be loss- 
less for a non-zero source entropy. (The fact that it actu- 
ally has a vanishing capacity — or a maximum loss — will 
be shown below. This result can also be derived from an 
argument based on the impossibility of cloning, as shown 
in Ref. ||.) 

Let us now derive a general expression for an upper 
bound on the mutual entropy of the n-bit channel (or, 
equivalently, a lower bound on the overall loss). Using 
Eqs. (|3.1lD and (|3.12|), we have 



I(p)<S(R) + Y,(l-pT- e P e 

e=0 

x [s(Qu(c)) - S(Q e (c)) 



(3.17) 



We first rewrite Eq. ( 3.17 ) as a summation up to n/2 (we 
assume here that n is even), by combining each channel 
c in this sum with its dual channel where erased qubits 
are unerased and conversely: 

n/2 

I(p) < S(R) +J2[(1- P) n - e p e - (1 - p) e P n - e 



[s(Qu(c)) - S(Q e (c)) 



(3.18) 



(Note that the term with e = n/2 is vanishing.) We now 
follow the reasoning that we used earlier to derive the 
Singleton bound, and group the channels in pairs (c and 
c') which "overlap" in n — 2e qubits denoted by Q* (see 
Fig. ^). Thus, for a given value of e, we have to calculate 
the sum of terms [S(Q u (c)) — S(Q e (c))] for channels c 
and c'. We have 

S(Q U ) - S(Q e ) + S(Q' U ) - S(Q' e ) 

= S(Q' e Q*) - S(Q e ) + S{Q e Q*) - S(Q' e ) 



< 2S(Q*) 

< 2(n-2e) 



(3.19) 



where the last inequality reflects the limitation on S(Q*) 
imposed by the dimension of the Hilbert space of Q*. 
Thus each pair of terms in the summation over c can be 
bounded from above by 2(n — 2e), so that a simple cal- 
culation yields an upper bound on the mutual entropy of 
the joint (n-bit) channel 



e=0 



I(p) < S(R) P (1- p) n - e P e (n - 2e) 



<k + n(l - 2p) 



(3.20) 



or a lower bound on the overall loss of the joint channel 
L(p) > k + n(2p - 1) (3.21) 

This results in a lower bound on the average loss (per 
qubit) for a quantum erasure channel of probability p as 
a function of the rate R = k/n, 



Hp) 



> R + 2p - 1 



(3.22) 



(Note that this is only a lower bound on the loss, which 
is, by definition, a non-negative quantity.) This inequal- 
ity is consistent with l(p) + 1(1 -p) > 2R [cf. Eq. (|3.15|) 1- 
It implies that a vanishing average loss (i.e., the reliable 
transmission of information through the n-bit probabilis- 
tic channel) is only possible if the rate 



R < 1 - 2p 



(3.23) 



Thus, a quantum erasure channel with p — 1/2 (i.e., if 
the channel is erasing 50% of the qubits) has a zero ca- 
pacity. Eq. ( 3.23| ) confirms the linear interpolation 6 be- 
tween the 50%-erasure channel (for which the capacity is 
zero) and a noiseless channel (for which the capacity is 
one) that was used in Refs. pj|Jl9]|, In addition, since it 
is shown in Refs. |3 19| that this upper bound coincides 
with a lower bound obtained from one-way random hash 



coding, Eq. ( p.23[ ) therefore describes the exact capacity 
of the quantum erasure channel C = 1 — 2p jL9[ . 



In Ref. |J, it is shown that the capacity of a composite 
channel (which is a convex combination of a perfect and an 
imperfect quantum channel) cannot exceed the appropriately 
averaged capacity of these two component channels. In other 
words, the quantum capacity cannot be superadditive when 
"mixing" a perfect channel with an imperfect one. 
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D. Quantum depolarizing channel 

We now consider a quantum depolarizing channel with 
error probability p. In this channel, each qubit interacts 
with the environment such that it undergoes a bit-flip 
(a x rotation), a phase- flip (a z rotation), or the combina- 
tion of both {a y rotation) with probability p/3 each. In 
Sees. Ill A and IIIB, we have seen that the simple con- 
nection between quantum error- and erasure-correcting 
codes provides a trivial relation between the resulting 
upper bounds on the rate of channels subjected to a 
bounded fraction of errors or erasures. Unfortunately, 
there is no such simple relationship when comparing the 
quantum erasure and depolarizing channels. As a mat- 
ter o f fa ct, the upper bound on error-correcting codes, 
Eq. (3JS), is not immediately applicable to the quantum 
depolarizing channel. Such a situation results from the 
fact that the definition of an error-correcting code re- 
quires that all error patterns (of at most t = pn errors) 
are perfectly corrected, while a rate will be said to be at- 
tainable through a channel (i.e., it is below the quantum 
channel capacity) whenever the fraction of uncorrected 
error patterns is asymptotically vanishing (for n — » oo). 
Still, the reasoning used to calculate the Singleton bound 
on the quantum erasure channel is applicable to the quan- 
tum depolarizing channel as well. 

Assume that an individual qubit, Q, is initially entan- 
gled with the reference, so that RQ is in the state \^rq) 
(for example the singlet state). After processing by the 
channel, the system RQ' is in a Werner state |2(| of "en- 
tanglement fidelity" F = 1 — p, that is, the mixed state: 



PRQ> = (l-4p/3)|tfflQ>(tfflQ| 



(3.24) 



In other words, the qubit emerges at the output of the 
channel either in a random state (having totally lost the 
entanglement with R) with probability 4p/3, or in its 
intact original state (fully entangled with R) with prob- 
ability 1 - 4p/3. When p = 3/4, the channel is 100% 
depolarizing, i.e., its output is random. 

In the joint n-bit channel, each qubit undergoes the 
above evolution independently of the other ones. As be- 
fore, the resulting n-bit probabilistic channel can be de- 
scribed as a superposition of binomial channels, in which 
each qubit is either kept unchanged or "randomized" . 
The distribution of the underlying channels is thus a bi- 
nomial one, just as in the previous Section, the only dif- 
ference being that p is replaced here by 4p/3. The entire 
calculation of the previous Section can then be repeated, 
because the "randomization" or "erasure" of a qubit are 
equivalent as far at the mutual entropy with R is con- 
cerned. Indeed, for a channel c, we have 



where Q u (c) correspond to qubits that are not random- 
ized (rather than not erased) in channel c. This is obvious 
because the randomized qubits [in state ( 1 0) (0 1 + 1 1) ( 1 1 ) / 2] 
are independent of i?, just as the erased qubits (in state 
|2». 



First, in analogy with Eq. ( 3.15| ), we have 

L{p) + £(3/4 - p) > 2k (3.26) 



implying that £(3/8) > k, so that the 
izing channel with probability p = 3/8 
(in fact, it has a vanishing capacity, 
Equivalently, we have L(p) > 1(3/4 - 
the p = 3/4 channel cannot transmit 
turn information, i.e., £(3/4) = £(0) = 
Singleton lower bound on the average 



quantum depolar- 
cannot be lossless 
as shown below). 
- p) , showing that 
classical or quan- 
= 0. The resulting 
loss (per qubit) is 



lip) 



HP) 



>R + 8p/3 - 1 



(3.27) 



in analogy with Eq. ( [3.22] ). Consequently, the quantum 
depolarizing channel can have a vanishing average loss 
(i.e., allow an asymptotically reliable transmission of in- 
formation by using blocks of n qubits) at the condition 
that 



R < 1 - 8p/3 



(3.28) 



as originally shown in Ref. Thus the quantum de- 
polarizing channel with p = 3/8 has a zero capacity for 
the transmission of quantum information. (Note that 
such a channel corresponds in fact to a 50%-depolarizing 
channel, where 50% of the qubits are replaced by a ran- 
dom qubit. This channel can obviously not have a non- 
zero capacity, as a consequence of the no-cloning theo- 
rem ||.) As for the quantum erasure channel, a linear 
interpolation between the perfect channel and the 50%- 
depolarizing channel can be used (and also results inde- 
pendently from the our reasoning). Note that Eq. ( 3.2S ) 
yields only an upper bound on the capacity of the quan- 
tum depolarizing channel, which is provably not achiev- 
able (in contrast with the equivalent bound for the quan- 
tum erasure channel). Indeed, a tighter bound for the de- 
polarizing channel has been obtained very recently pTj] , 



R < 1 - 4p 



(3.29) 



Ic = S(R:Q' C ) = SiR:Q u ic)) 



(3.25) 



which is based on the Buzek-Hillery universal cloning ma- 
chine p8|. While Eq. Q3.29] ) happens to be equivalent 
to Eq. (3.8), there appears to be no direct relation be- 
tween them. A simple intuitive reason why this bound 
is stronger than Eq. Q3.28 ) can be understood by realiz- 
ing that the two quantum channels underlying the uni- 
versal cloning machine (from the single input to both 
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outputs) cannot be described classically. 7 Indeed, when 
tracing over one of the outputs of the universal cloning 
machine, the other output appears to emerge from a 
p = 1/4 (or F = 3/4) channel, i.e., a 33%-depolarizing 
channel. This looks as if the qubit was sent with proba- 
bility 2/3 to each output of the cloning machine, which 
is obviously not understandable in classical terms. Only 
a quantum superposition, involving the cloning machine 
and both outputs, can account for this situation and re- 
sults in a stronger upper bound ]27j]. The information- 
theoretic analysis of quantum cloning (and related en- 
tropic bounds) will be the subject of further investiga- 
tion. 

To our knowledge, the only stronger upper bound on 
the capacity (for a restricted range of p values) of the 
depolarizing channel is R < 1 — H (p), based on a connec- 
tion between quantum additive (or stabilizer) codes and 
classical linear codes [ pj| . 

IV. QUANTUM CHANNEL WITH AN 
AUXILIARY CLASSICAL CHANNEL. 

In this Section, we consider a quantum channel which 
is supplemented with a classical channel (assumed to be 
noiseless and of unlimited classical capacity). We are in- 
terested in calculating the Singleton upper bound on the 
rate of reliable (L = 0) transmission of quantum informa- 
tion through a noisy quantum channel, knowing that a 
classical side channel can be used simultaneously for for- 
ward communication only. In particular, we aim at ana- 
lyzing how our quantum information-theoretic formalism 
accounts for the property that such a classical (one-way) 
communication channel does not increase the quantum 
capacity of the noisy channel . 

A. Entropic treatment of the channel 

We first consider the problem in the language of quan- 
tum codes, following closely Ref. jl5|]. As explained ear- 
lier, the result will then be immediately applicable to 
channels with an error model where the fraction of er- 
rors or erasures is bounded. Also, the expression of a 
"probabilistic" joint channel as a binomial superposition 
of underlying channels will then yield the corresponding 



bounds for the quantum erasure or depolarizing chan- 
nel. We start with a "logical" system L (logical words), 
which is initially entangled with the reference R (the ini- 
tial mutual entropy being 2k, so that k arbitrary qubits 
are transmitted). As before, the system RL is initially in 
a pure entangled state \^b.l)- We assume that the en- 
coding operation on L includes a partial measurement, so 
that the encoder has a quantum and a classical output, 
as shown in Fig. |^. 

FIG. 5. Schematic representation of the "encoder" per- 
forming a unitary transformation on the "logical" system L 
(initially entangled with R with a mutual entropy 2k). The 
outputs are: the n "physical" qubits Q, the classical bits C, 
and the "precursor" P ("microscopic" classical bits, before 
amplification). Each classical bit can be thought of as a set 
of qubits that are classically correlated when tracing over P, 
appearing then as a collective classical variable (see |pjf ). 
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Note that the encoding process can be described in 
terms of a unitary transformation applied on L (supple- 
mented with an ancilla initially in a |0) state) without any 
projection operators, so that the classical output is sim- 
ply described as a mixture of orthogonal quantum states. 
More precisely, we have a quantum output Q (consisting 
of n "physical" qubits), P, which is the "precursor" of 
the classical variable, and C , which represents the am- 
plified classical variable. In general, C is a discrete vari- 
able of arbitrary dimension, but we often use the term 
classical bit (two-dimensional variable) in the following. 
Each classical bit C can be thought of as a set of qubits 
that become classically correlated when tracing over the 
corresponding "precursor" , and can then be viewed as a 
collective variable. The resulting amplification gives to 
C the appearance of a classical variable (see p9J ). It is 
convenient to keep the classical output C separate from 
its precursor P for reasons that will appear later. 

To fix the ideas, let us write the intermediate wave 
function of RQP before amplification: 



7 Remember that a standard no-cloning argument to show 
that the rate vanishes at p — 3/8 is based on a classical ma- 
chine that is transmitting an input qubit to one of two outputs 
with probability 1/2, a random qubit being sent on the other 
output, which results in two 50%-depolarizing channels jfjj. 
Clearly, this is the most constraining bound on R that can be 
constructed by use of such a "classical cloning" . 
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(4.1) 



We define the two parameters k and c as 



Here, the \4> P ) form a set of orthogonal states for P, 
and the IV'hq) are orthogonal states of RQ. Eq. (4.1) 
is simply the Schmidt decomposition of I^rqp) divided 
in RQ versus P, and implies S(RQ) = S(P) before am- 
plification. It is important to note that the orthogonal 
states \4> % p) for P correspond to the classical informa- 
tion that will be amplified (the amplification of P will 
be performed in this basis), the clas sical variable being 
distributed according to pi. Eq. (4.1) is the most general 
expression of the output of the encoder (before ampli- 
fication of P) if we require that its overall operation is 
unitary (the joint state of RQP must be pure). We also 
have obviously S(R:QP) = S(R:L) = 2k as a result of 
the conservation of mutual entropy under a local unitary 
operation on both subsystems |0| . 

The amplification of the precursor (symbolically repre- 
sented by the XOR gates inside the encoder) in the \<j> P ) 
basis gives rise to a total wave function for the system 
RQPC (at the output of the encoder) of the form: 



I* 



RQPC, 



= J2VPiWrq)®\0p)®\0c) (4-2) 



where the \<j> 1 ) correspond to an orthogonal set of states 
for both P and C. The classical information in P has 
been "amplified" , so that the precursor P and the collec- 
tive set of qubits C are fully classically correlated when 
tracing over the remaining variables. The entropy of the 
classical channel is S(C) = S(P) = H[pi\. In fact P and 
C are interchangeable, but we need to keep them sepa- 
rate to account for the fact that amplifying the classical 
bits, i.e., tracing over the precursor P, results in a mixed 
state for the systems R, Q, and C. The density matrix 
for the system RQC is given by 



Prqc = ^kI^qX^qI ® \4>c)(<l>c 



(4.3) 



This can be viewed as a classical mixture of orthogonal 
states of RQ and C. Thus, conditionally on the classical 
bits C, the system RQ is in a pure (generally entangled) 
state \ippq}- The question now will be, roughly speaking, 
to determine under which circumstances the quantum 
output Q (possibly altered by decoherence or partially 
erased) retains the full entanglement with R when it is 
supplemented with the classical information C . In other 
words, the question will be whether decoding using C al- 
lows to perfectly recover the logical words (and is more 
effective than in the absence of classical information) . 



S(R) = k 

S(C) = S(P) = S(CP) = c 



(4.4) 
(4.5) 



where the second equation is due to the fact that C and 
P are fully classically correlated, i.e, S(C:P) = c. The 
parameter c = H[pi] simply represents the Shannon en- 
tropy processed by the classical side channel. As before, 
the parameter k stands for the number of logical qubits 
(i.e., fc-qubit arbitrary states are encoded), or, equiva- 
lently, the source entropy when considering a quantum 
channel. 

A necessary condition for the global (classical + quan- 
tum) channel to be lossless is clearly that the amplifica- 
tion of P is a lossless process, that is, does not destroy 
the quantum coherence of the logical words. More pre- 
cisely, the constraint we must express is that the channel 
L — > QC (considered as a quantum channel) is lossless, 
that is 



S{R:P) = 



(4.6) 



where P plays the role of an "environment" for this chan- 
nel. This means that, when amplifying the classical bits 
C (ignoring the precursor P) no entanglement with R is 
lost. In other words, the mutual entropy 2k with R is 
entirely found in QC, i.e., S(R:QC) — 2k, so that no 
entanglement with R leaks out in P when tracing over 
P. (This is so even though the joint system QPC is fully 
entangled with R, i.e., S(R:QPC) = 2k.) The condition 
(4.6) implies that 



S(QC) = S(RP) 

= S(R) + S(P) - S{R:P) 

= k + c 



(4.7) 



Thus, the parameters k and c fully determine the ternary 
entropy diagram 8 for R, QC, and P, as shown in Fig. ||. 
Note that C and P are interchangeable, so that we have 
also S(QP) = k + c. 

A fourth parameter is necessary to fully describe the 
entropies of the 4-partite system (R, Q, C, and P): 



S{Q) = s 



(4.8) 



It corresponds to the von Neumann entropy of the quan- 
tum input of the noisy channel (or quantum output of 
the encoder). Grouping C and P, we can also display 
the ternary entropy diagram for R, Q, and CP, using the 



8 The ternary diagram of a tripartite system in a pure state 
is determined in general by three parameters, for example the 
reduced entropy of each of the three components M] . 
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fact that the joint system is in a pure state (cf. Fig. [?]). 
It shows that neither the quantum output Q, nor the 
classical one CP before amplification (i.e., including the 
precursor) is unentangled with R. In short, the situation 
is that P alone is unentangled with R (it can be traced 
over without altering the entanglement with R) , while P 
and C together are entangled with R. Moreover, even 
though the classical output can be amplified without los- 
ing coherence, the classical information C is in general 
necessary (together with Q) in order to recover the ini- 
tial entanglement of L with respect to R, as implied by 
S(R:QC) = 2k. 

FIG. 6. Entropy diagram characterizing the reference R, 
the system QC (quantum and classical output of the en- 
coder), and P (precursor or the classical bits). The condition 
S(R:P) = means that the full mutual entropy 2k with R 
is found in QC and does not leak out when tracing over P, 
so that the amplification is lossless. The two parameters are 
S(R) = k and S(P) = c. 




FIG. 7. Entropy diagram characterizing the reference R, 
the quantum system Q (output of the encoder), and PC 
(the classical output and precursor before amplification). The 
three parameters are S(R) = k, S(Q) = s, and S(CP) = c. 
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We now want to describe the entropic situation after 
amplification of the classical bits, i.e., after tracing over 
the precursor P. The density matrix for the system RQC 
(or equivalently for the system RQP) is given by 

Prqc = 



The system RQ is thus classically correlated with C, and 
we have 



For each value i of the classical bits (occurring with prob- 
ability pi), RQ is in a given pure (generally entangled) 
state \i> l RQ ). Now, using S(QP) = S(QC) = k + c, we 
can show that R is independent of C: 

S(R:C) = S{R) + S(C) - S(RC) 
= S(R) + S(C) - S(QP) 
= k + c- (k + c) = (4.11) 

This means that, if the amplification of the classical bits 
is lossless (that is, cannot result in an irrecoverable lost 
of mutual entanglement with R), then the "amplified" 
classical bits C must be statistically independent of R. 
In other words, no information about the encoded logi- 
cal word is found in the "amplified" classical bits C even 
though C is in general necessary to recover the entangle- 
ment with R. Using S(QC) = S{R) + S(C), the mutual 
entropy between Q and C can be written as 

S(Q:C) - S(Q) + S(C) - S(QC) 
= S(Q) - S(R) 

= s-k (4.12) 

Thus, the quantum output Q is in general not indepen- 
dent of the classical bits C (in contrast with R). This 
simply means that, in general, the encoder can intro- 
duce some extra entanglement between Q and CP, ad- 
ditionally to the initial entanglement 2k between Q and 
R, giving rise to a non-vanishing mutual entropy. How- 
ever, this additional entanglement is useless as far as the 
transmission of quantum information is concerned, and 
we will see below that the interesting situation corre- 
sponds to s = k, in which case S(Q:C) = 0. Finally, 
it is easy to see that RQC is in a mixed state of en- 
tropy S(RQC) = S(P) = c. The la tter co nditio n, to- 
gether with Eqs. (fh4|), (p|, (Q, ( pil| ), ( pLlll ), and 
( 4.12| ), fully describes the entropies of the tripartite sys- 
tem RQC after tracing over P (i.e., after amplification). 
The corresponding ternary entropy diagram is presented 
in Fig. ^. 

FIG. 8. Entropy diagram characterizing the reference R, 
the quantum system Q (output of the encoder), and the clas- 
sical output C after amplification. The joint system is in a 
mixed state of entropy S(RQC) = c. Note that R is indepen- 
dent of C (i.e., C contains no information about the encoded 
logical word), while RQ is (fully) classically correlated with 
C. 



S(RQ:C) = S(RQ) = S(C) = c 



(4.10) 
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In summary, we have S(R) — fc, S(Q) — s, S(C) = c, 
S(RQ) = c, S(RC) = k + c, S{QC) = k + c, and 
S(RQC) — c, which implies that 

S{R:Q:C) = S(R) + S{Q) + S(C) - S(RQ) 
-S{RC) - S(QC) + S(RQC) 
= s-k-c (4.13) 

As visible from this diagram, the system RQ is in a pure 
entangled state conditionally on the amplified classical 
variable C, i.e., S(RQ\C) = 0, with a characteristic di- 
agram (— fc, 2fc,—fc). Also, RQ is classically correlated 
with C, with a diagram (0, c, 0), so that the simultane- 
ous knowledge of R and Q yields C, i.e., S(C\RQ) = 0. 
Finally, it is easy to check that the diagram for R vs. C 
is (fc, 0, c), i.e., C is independent of R. 

Several inequalities relating the parameters fc, c, and 
s, must be satisfied. First, the subadditivity of entropies 
implies that 

S(R:Q) = k + s-c>0 (4.14) 
S(Q:C) = s - k > (4.15) 

Similarly, the strong subadditivity of entropies implies 
that 

S{R:C\Q) = fc + c- s > (4.16) 

These inequalities can be summarized as 

0<s-k<c<s + k (4.17) 

implying namely that < c < 2s. The two limiting cases 
are (i) c = and s = fc, which corresponds to a quan- 
tum channel without a classical side channel; and (ii) 
c = 2s = 2k, which corresponds to the situation where 
the entropy of the classical channel is maximum. In the 
following, we will focus on case (ii), that is, when a max- 
imum amount of classical information is processed, since 
it is supposedly the case where the classical side channel 
might help the transmission of quantum information the 
most. 

FIG. 9. Entropy diagrams before amplification of the clas- 
sical bits in the case where the processed classical information 
is maximum (c = 2s = 2k). As before, S(R:P) = 0, so that 
the amplification of the precursor P does not affect quantum 
coherence. Note that S(R:Q) = 0, so that the the quantum 
output Q can be erased if no amplification is performed. 
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FIG. 10. Entropy diagram after amplification of the classi- 
cal bits in the case where the processed classical information 
is maximum (c = 2s = 2k). Here, R and Q play the same role. 
The classical variable C contains the information about which 
entangled state RQ is in, while R or Q alone is independent 
of C. This diagram with k = 1 plays a crucial role in the 
entropic description of superdense coding and teleportation, 
as shown elsewhere |io| |, 
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We display in Figs. || and the entropy diagrams 
corresponding to the limiting case c = 2k = 2s. Note 
that S(R:Q) = as shown in Fig. |[ so that the chan- 
nel L — > CP is lossless. Thus, the entire entanglement 
with R is retained in the unamplified classical variable 
CP. We will discuss this below. Fig. [n] implies that 
S(R:C) = S(Q:C) — 0, i.e., Q and R are both indepen- 
dent of the classical variable C. This emphasizes the fact 
that R and Q play exactly the same role in this limit- 
ing case (ii). The peculiar feature here is that Q and 
R together are fully correlated with C, according to the 
diagram (0, 2k, 0), although Q or R taken separately is 
independent of C. In other words, the classical variable 
C contains the information about which entangled state 
RQ is in (i.e., the mutual entropy 2k) while it contains 
no information about Q or R alone. 

The diagram in Fig. ^| plays an important role in 
the information-theoretic description of quantum tele- 
portation and superdense coding, as shown in a further 
work J|o| . In the special case where k = 1 (no block cod- 
ing is used), it describes quantum teleportation in the 
following sense: Q is the particle that is initially sent to 
Bob (one half of the Bell state shared with Alice), and 
C are the 2 classical bits that Alice sends to Bob. If the 
teleported particle (here L) is initially entangled with R 
so that RL is in a Bell state, then Q ends up in an en- 
tangled state with R which is one of the four Bell states, 
conditionally on C. (Teleportation is completed by hav- 
ing Bob applying a unitary transformation on Q that is 
specified by C.) Thus, the 2 classical bits code for one 
of the four Bell states, and the corresponding diagram is 
shown in Fig. [l(] with fc = 1. The amplification of the 
classical bits by Alice does not destroy coherence, since 
we have S(R:P) = 0, and bringing Q and C together 
yields the initial entanglement with respect to R, i.e., 
S(R:QC) — 2. The non-classical feature here is that the 
latter equation can be satisfied even though S(R:Q) = 
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holds at the same time, that is, the particle that Bob re- 
ceived initially is independent of R (this must be true as a 
consequence of causality). Thus, the entire entanglement 
with R is carried by C, as reflected by S(R:C\Q) = 2. 
This will be discussed in more details elsewhere [ |30| . 

Note finally that the vanishing mutual entropy between 
R and Q implies that no entanglement with R is found in 
Q alone. Therefore, the quantum output Q can be erased 
without loosing the entanglement with R, provided that 
the classical variable is not amplified (by keeping CP). 
In other words, since S(R:Q) — 0, the knowledge of the 
classical bits C (along with the precursor P) is sufficient 
to recover L, even in the absence of Q. For example, 
in teleportation, the unamplified classical bits alone are 
enough to teleport an arbitrary state, so that the qubit 
Q that Bob received initially can be erased. (Of course, 
this is unrealistic, since one never access to all the mi- 
croscopic degrees of freedom making the classical bits. 
Tracing over one of them is enough to loose the quantum 
information if Q is erased.) 



B. Singleton bound on a quantum channel with a 
classical side channel 



Let us now repeat the reasoning which results in the 
Singleton bound on a quantum code (Section III), but 
taking into account the classical auxiliary channel. Thus, 
we assume again that the quantum output Q of n qubits 
is partitioned into an erased piece Q e (of e qubits) and 
an unerased one Q u (of n — e qubits). We are seeking for 
a necessary condition for the possibility of recovering the 
erasure of Q e when the decoder has access to the clas- 
sical information C (so the decoding operation can be 
conditional on C). As before, we consider two different 
partitions of Q (see Fig. [|), and express a lower bound on 
the entropy of the "overlap" Q* . The basic constraints 
(which must be satisfied simultaneously) are 



S(R:Q e P) = 
S{R:Q' e P) = 



(4.18) 
(4.19) 



expressing the fact the no entanglement (with respect to 
R) is lost when amplifying the classical bits and erasing 
Q e (or Q' e ). Equivalently, the full initial entanglement of 
L must be "squeezed" into Q u (or Q' u ) and C: 



S{R:Q U C) = S(R:QC) = 2S(R) 
S{R:Q' U C) = S(R:QC) = 2S(R) 



(4.20) 
(4.21) 



In other words, the knowledge of the unerased part Q u 
(or Q' u ) is sufficient to reconstruct (using C) the initial 
logical word. Since the system RQ u Q e PC is in a pure 
state, we have 



where we used the fact that P and C are interchangeable. 
Now, dividing Q u into Q' e and Q* , we can write an upper 
bound on the mutual entropy between R and Q U C ', 



S(R:Q U C) = S{R) 
= S(R) 
<S(R) 



S{Q U C) - S{RQ U C) 
S(Q' e Q*C) - S(Q e C) 
S{Q' e C) + S(Q*) - S(Q e C) 



(4.23) 



where we have used the subadditivity of quantum en- 
tropies. Eqs. (|4.20| ), ( |4.2l| ), ( [4.23|) , and its counterpart 
(when Q u is replaced by Q' u ) thus give 

S{R) - S{Q*) < S{Q' e C) - S{Q e C) (4.24) 
S{R)-S{Q*) < S(Q e C) - S(Q' e C) (4.25) 

Combining these two last inequalities results in the same 
inequality as in the case where no classical auxiliary chan- 
nel is used: 



S(R) < S{Q*) <n-2e 



(4.26) 



Therefore, we obtain the same quantum Singleton bound 
for a quantum code supplemented with a noiseless clas- 
sical channel as in the absence of such a channel: 



k < n — 2e 



(4.27) 



In other words, the classical side channel (transmitting 
data with an entropy up to twice the quantum source 
entropy k) does not increase the Singleton bound on the 
maximum attainable distance for quantum codes. 

This result can be immediately applied to a quantum 
channel characterized by a p-bounded fraction of erasures 
(cf. Section IIIA) and supplemented with an auxiliary 
classical channel, since, in that case, the use of a ((n, k)) 
code protecting for e = np erasures is enough to guaran- 
tee reliable transmission. Therefore, the upper bound on 
the rate is given by 



R < 1 - 2p 



(4.28) 



similar to Eq. (3.1), confirming the fact that the classical 
side channel does not enhance the quantum information 
transmission through the quantum channel || . 

In the case of a quantum erasure channel (with era- 
sure probability p) supplemented with a classical chan- 
nel, the entire reasoning of Section IIIC can be repeated, 
the only difference being that one has to calculate the 
sum of [S(Q U C) — S(Q e C)\ for two overlapping channels 
(c and c'): 



S(Q U C) - S(Q e C) 

< 2S(Q*) 

< 2(n - 2e) 



S(Q' U C) ~ S(Q' e C) 



(4.29) 



S(RQ U C) = S(Q e P) = S{Q e C) 



The resulting bound on the mutual entropy is thus the 



(4.22) same as Eq. (3.20), so that we have the same upper bound 
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on the rate of reliable transmission of quantum informa- 
tion: 

R<l — 2p (4.30) 

Finally, the cases of a channel with a p-bounded frac- 
tion of errors and the quantum depolarizing channel can 
be treated exactly as in Sections IIIB and HID, so that 
the classical side channel does not modify the Singleton 
upper bound on the rate in both cases. 

V. CONCLUSION 

The search for better bounds on the capacity of quan- 
tum channels such as the depolarizing channel is still a 
major endeavor in quantum information theory today. 
Clearly, entropic considerations alone do not suffice to 
prove that a reliable quantum coding scheme exists with 
achieves a transmission rate arbitrarily close to the ca- 
pacity. As a matter of fact, a similar situation prevails for 
classical channels as well. Nevertheless, an entropic ap- 
proach appears to be helpful in order to derive bounds on 
the capacity of classical or quantum channels from similar 
principles, and to analyze classical and quantum commu- 
nication in a unified framework, as shown in this work. 
More generally, the leading idea underlying the approach 
to quantum information presented in this work and in 
Refs. [p|— |l3[| is to build a theory that extends Shannon's 
concepts to the quantum regime. Rather than attempt- 
ing to define a distinct (purely quantum) information the- 
ory that would apply to the transmission of quantum 
states only, we prefer to consider an extended Shannon 
theory, which should account for the processing of clas- 
sical as well as quantum information (arbitrary classical 
or quantum states). After all, any classical information- 
processing system should be describable, in principle, in 
terms of its underlying quantum mechanical degrees of 
freedom. In this sense, Shannon theory should simply 
be viewed as a special case of a more general theory of 
information in quantum mechanics that remains to be 
built. 

The central characteristics of a quantum extension of 
Shannon theory happens to be that negative von Neu- 
mann conditional entropies must be used in order to 
encompass quantum entanglement in an information- 
theoretic description. It is pointed out in Ref. |l(J that 
this apparently innocent observation should be viewed 
as the central novelty of a quantum mechanical exten- 
sion of Shannon theory beyond its original range. Since 
most of the classical concepts of Shannon theory have 
a straightforward quantum analog, it is possible to re- 
peat a great part of the classical reasoning and apply it 
to quantum information processes, as shown in this pa- 
per and our previous work. More specifically, we have 
shown here that an information-theoretic description of 



noisy quantum channels following closely Shannon theory 
provides insight into the derivation of entropic bounds 
on the quantum capacity (the maximum rate at which 
quantum information can be reliably processed in spite 
of the noise). Namely, the entropic Singleton bound on 
quantum error-correcting codes |0| can be used in order 
to investigate standard quantum channels such as the 
quantum erasure channel or the quantum depolarizing 
channel. The same formalism can be extended in order 
to account for an auxiliary classical channel used for for- 
ward communication besides the noisy quantum channel. 
Entropic Singleton bounds can be derived in the latter 
case too, showing that the classical channel does not en- 
hance the quantum capacity (or an upper bound on it), 
in agreement with what was proven in Ref. ||. 

The central part of the reasoning consists in calculat- 
ing a lower bound on the average loss of the channel (i.e., 
the loss of the joint channel made of n consecutive uses 
of a memoryless channel, divided by the number of pro- 
cessed symbols n — ► oo) which characterizes the "qual- 
ity" of the transmission. If the use of block coding makes 
the joint (n-bit) channel lossless (i.e., the average loss is 
zero) , then reliable transmission of quantum information 
is achievable. This is true even though the one-symbol 
loss (for a single use of the channel) is non-zero, reflect- 
ing the alteration due to noise in each use of the channel. 
Perfect transmission by block coding is thus possible pro- 
vided that this lower bound on the average loss is zero or 
less, which results in an upper bound on the attainable 
rate. 

Obviously, there remains much to be done in order 
to derive better bounds on the rate (or perhaps the ex- 
act capacity) using such an entropic approach. We have 
made a progress in this direction, as illustrated by the 
entropic Singleton bound on the capacity of the quan- 
tum erasure channel (C < 1 — 2p) which happens to be 
the exact capacity calculated in |Q. For the quantum 
depolarizing channel, however, we obtain a well-known 
bound on the capacity (C < 1 — 8p/3, seejql) which has 
been recently shown not to be attainable [E7| . Neverthe- 
less, the characterization of the exact quantum capacity 
of the depolarizing channel is still an open problem, and 
it is possible that the entropic approach presented here 
could be further improved. Also, the issue of the attain- 
able capacity of a general noisy quantum channel might 
be explored along the same lines. The search for bet- 
ter entropic bounds on the capacity of quantum channels 
will be the subject of future work. 
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APPENDIX A: INFORMATION-THEORETIC 
CHARACTERIZATION OF A NOISY CLASSICAL 
CHANNEL 

In this Appendix, we outline the information-theoretic 
description of noisy classical channels, for the sake of 
clarifying the correspondence with the treatment of noisy 
quantum channels used throughout this paper. At first 
sight, a classical channel seems very different from a 
quantum channel as no classical "reference" is used to 
purify the input. Also, the classical input-output joint 
probability distribution has no quantum equivalent, since 
there is no joint state for the initial quantum system Q 
and the final system Q' (it is the same system). However, 
a classical channel can be thought of as a device which 
processes classical correlation (with respect to a reference 
R). If the input X is initially fully correlated with a ref- 
erence R, then the residual mutual entropy between the 
output Y and R is a measure of the amount of correlation 
(or information) transmitted through the channel. For a 
quantum channel, we consider the processing of entan- 
glement (with respect to R) rather than correlation, so 
that the residual mutual entropy between the decohered 
quantum system Q' (the quantum output) and R is the 
interesting quantity. This makes the classical-quantum 
correspondence easier to understand. 

A noisy classical channel with input X and output Y 
is characterized by 



I = H{X:Y) 
L = H{X\Y) 
N = H{Y\X) 



(Al) 
(A2) 
(A3) 



where /, L, and N denote the information, the loss, and 
the noise, respectively (see, e.g., @]). Information pro- 
cessing through the channel is measured by the balance 
between / and L, these two quantities summing to the 
source entropy: 



H(X) 



(A4) 



The loss measures the inherent uncertainty in the pro- 
cess of inferring the input of the channel from the altered 
output (decoding), that is, the entropy of the input con- 
ditional on the output. When the loss L is zero (lossless 
channel), the information / is maximum so that classical 
information is perfectly transmitted through the channel. 
Conversely, when 1 = (and L is maximum), no classi- 
cal information is processed by the channel. The noise N 
reflects the uncertainty of the output symbol for a given 
input symbol, and is irrelevant as far as the transmission 
of information is concerned (it corresponds to the loss of 



the reverse channel where the input and output are inter- 
changed). A channel with N = is called deterministic, 
and a channel which is both deterministic and lossless is 
named noiseless. We follow the same nomenclature for 
quantum channels in this paper. 

If n independent channels arc used in parallel with 
X\ ■ ■ ■ X n as an input string and Y\ ■ ■ ■ Y n as an output 
string, it can be shown that the information and the loss 
are subadditive: 



I < Il- 
ls < Li 



In 

' L n 



(A5) 
(A6) 



where I (L) is the information (loss) of the joint (n- 
symbol) channel, while i, (Li) is the information (loss) 
of the i-th individual channel Xi — > Y±. Property flAg) 
results from the subadditivity of Shannon entropies 



H(Yi---Y n ) <H(Y{) 



H(Y n ) 



(A7) 



and from the fact that the channels are independent (each 
Yi depends on Xi only) 

I = H(Xi---X n :Yi---Y n ) 
= H(Yi---Y n )-H(Y x ---Y n \Xi 



< 



< 



i=l 



-H(Y i \Yi---Y i _i,X 1 ---X n ) 



[H(Y) 

J2[h(y 1 )-h(y 1 \x 1 )} 



(A8) 



Property (A6) is an immediate consequence of the sub- 
additivity of Shannon conditional entropies 



H(X 1 ---X n \Y 1 ---Y n ) <H(Xi\Yi) 



H(X n \Y n ) 
(A9) 



Since the information and the loss of each individual 
(one-symbol) channel sum to the source entropy of that 
channel 



I i + L i = H(X i ) 



(A10) 



the allowed range for the overall loss of the joint channel 
is 



Li 



L n -M < L < Li 



L r , 



(All) 



with M = H(Xi) + ■■■ + H(X n ) - H(Xi ■ ■ ■ X n ) > 0. 
Consequently, the loss cannot increase by using block 
coding (i.e., using parallel channels with correlated in- 
put symbols), but it can decrease by an amount which 
is bounded by M . Note that M vanishes when the in- 
put symbols are independent, while a positive value of 
M reflects the correlation between the input symbols. 

It is simple to obtain a necessary condition for perfect 
transmission (i.e., with a vanishing overall loss) by block 
coding through a noisy channel. Clearly the condition 
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H{X 1 ---X n )<H{X 1 ) 



H(X n ) — Li — L n 

(A12) 



must be fulfilled for the lower bound on L to extend to 
zero. Therefore, the rate of transmission through the 
joint channel, R = H(X\ ■■■X n )/n is bounded from 
above by the averaged one-symbol information of the in- 
dividual channels: 



R < 



h 



In 



(A13) 



This is related to the weak converse of Shannon's noisy 
coding theorem [fl7|| : the transmission cannot be per- 
fect (or lossless, i.e., L = 0) if the rate of transmission 
exceeds the (averaged) mutual Shannon entropy charac- 
terizing each use of the channel. This is consistent with 
Shannon's result that the classical channel capacity is 
the maximum (over all input distributions) of the mu- 
tual information between channel input and output for a 
single use of the channel. Note that entropy considera- 
tions alone do not suffice to prove that a reliable coding 
scheme exists with achieves a transmission rate arbitrar- 
ily close to the capacity. A similar situation is found for 
quantum channels as well, as shown throughout this pa- 
per. Still, an entropic approach is very helpful in order 
to derive bounds on classical or quantum channels from 
basic principles, and to analyze classical and quantum 
communication in a unified manner. 
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